Unvisited URL Relevancy Calculation in Focused Crawling Based on Naïve Bayesian Classification
نویسندگان
چکیده
منابع مشابه
Unvisited URL Relevancy Calculation in Focused Crawling Based on Naïve Bayesian Classification
Vertical search engines use focused crawler as their key component and develop some specific algorithms to select web pages relevant to some pre-defined set of topics. Crawlers are software which can traverse the internet and retrieve web pages by hyperlinks. The focused crawler of a special-purpose search engine aims to selectively seek out pages that are relevant to a pre-defined set of topic...
متن کاملIncremental Naïve Bayesian Learning Algorithm based on Classification Contribution Degree
In order to improve the ability of gradual learning on the training set gotten in batches of Naive Bayesian classifier, an incremental Naïve Bayesian learning algorithm is improved with the research on the existing incremental Naïve Bayesian learning algorithms. Aiming at the problems with the existing incremental amending sample selection strategy, the paper introduced the concept of sample Cl...
متن کاملFocused Crawling System based on Improved LSI
In this research work we have developed a semi-deterministic algorithm and a scoring system that takes advantage of the Latent Semantic indexing scoring system for crawling web pages that belong to particular domain or is specific to the topic .The proposed algorithm calculates a preference factor in addition to the LSI score to determine which web page needs to preferred for crawling by the mu...
متن کاملEfficient Crawling Through URL Ordering
In this paper we study in what order a crawler should visit the URLs it has seen, in order to obtain more "important" pages first. Obtaining important pages rapidly can be very useful when a crawler cannot visit the entire Web in a reasonable amount of time. We define several importance metrics, ordering schemes, and performance evaluation measures for this problem. We also experimentally evalu...
متن کاملACNB: Associative Classification Mining Based on Naïve Bayesian Method
Integrating association rule discovery and classification in data mining brings a new approach known as associative classification. Associative classification is a promising approach that often constructs more accurate classification models (classifiers) than the traditional classification approaches such as decision trees and rule induction. In this research, the authors investigate the use of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2010
ISSN: 0975-8887
DOI: 10.5120/767-1074